19 research outputs found
Mining large-scale human mobility data for long-term crime prediction
Traditional crime prediction models based on census data are limited, as they
fail to capture the complexity and dynamics of human activity. With the rise of
ubiquitous computing, there is the opportunity to improve such models with data
that make for better proxies of human presence in cities. In this paper, we
leverage large human mobility data to craft an extensive set of features for
crime prediction, as informed by theories in criminology and urban studies. We
employ averaging and boosting ensemble techniques from machine learning, to
investigate their power in predicting yearly counts for different types of
crimes occurring in New York City at census tract level. Our study shows that
spatial and spatio-temporal features derived from Foursquare venues and
checkins, subway rides, and taxi rides, improve the baseline models relying on
census and POI data. The proposed models achieve absolute R^2 metrics of up to
65% (on a geographical out-of-sample test set) and up to 89% (on a temporal
out-of-sample test set). This proves that, next to the residential population
of an area, the ambient population there is strongly predictive of the area's
crime levels. We deep-dive into the main crime categories, and find that the
predictive gain of the human dynamics features varies across crime types: such
features bring the biggest boost in case of grand larcenies, whereas assaults
are already well predicted by the census features. Furthermore, we identify and
discuss top predictive features for the main crime categories. These results
offer valuable insights for those responsible for urban policy or law
enforcement
Leveraging Mobility Flows from Location Technology Platforms to Test Crime Pattern Theory in Large Cities
Crime has been previously explained by social characteristics of the
residential population and, as stipulated by crime pattern theory, might also
be linked to human movements of non-residential visitors. Yet a full empirical
validation of the latter is lacking. The prime reason is that prior studies are
limited to aggregated statistics of human visitors rather than mobility flows
and, because of that, neglect the temporal dynamics of individual human
movements. As a remedy, we provide the first work which studies the ability of
granular human mobility in describing and predicting crime concentrations at an
hourly scale. For this purpose, we propose the use of data from location
technology platforms. This type of data allows us to trace individual
transitions and, therefore, we succeed in distinguishing different mobility
flows that (i) are incoming or outgoing from a neighborhood, (ii) remain within
it, or (iii) refer to transitions where people only pass through the
neighborhood. Our evaluation infers mobility flows by leveraging an anonymized
dataset from Foursquare that includes almost 14.8 million consecutive check-ins
in three major U.S. cities. According to our empirical results, mobility flows
are significantly and positively linked to crime. These findings advance our
theoretical understanding, as they provide confirmatory evidence for crime
pattern theory. Furthermore, our novel use of digital location services data
proves to be an effective tool for crime forecasting. It also offers
unprecedented granularity when studying the connection between human mobility
and crime
Mining large-scale human mobility data for long-term crime prediction
Abstract Traditional crime prediction models based on census data are limited, as they fail to capture the complexity and dynamics of human activity. With the rise of ubiquitous computing, there is the opportunity to improve such models with data that make for better proxies of human presence in cities. In this paper, we leverage large human mobility data to craft an extensive set of features for crime prediction, as informed by theories in criminology and urban studies. We employ averaging and boosting ensemble techniques from machine learning, to investigate their power in predicting yearly counts for different types of crimes occurring in New York City at census tract level. Our study shows that spatial and spatio-temporal features derived from Foursquare venues and checkins, subway rides, and taxi rides, improve the baseline models relying on census and POI data. The proposed models achieve absolute R2 metrics of up to 65% (on a geographical out-of-sample test set) and up to 89% (on a temporal out-of-sample test set). This proves that, next to the residential population of an area, the ambient population there is strongly predictive of the area’s crime levels. We deep-dive into the main crime categories, and find that the predictive gain of the human dynamics features varies across crime types: such features bring the biggest boost in case of grand larcenies, whereas assaults are already well predicted by the census features. Furthermore, we identify and discuss top predictive features for the main crime categories. These results offer valuable insights for those responsible for urban policy or law enforcement
A Data-Driven Agent-Based Simulation to Predict Crime Patterns in an Urban Environment
Spatial crime simulations contribute to our understanding of the mechanisms that drive crime and can support decision-makers in developing effective crime reduction strategies. Agent-based models that integrate geographical environments to generate crime patterns have emerged in recent years, although data-driven crime simulations are scarce. This article (1) identifies numerous important drivers of crime patterns, (2) collects relevant, openly available data sources to build a GIS-layer with static and dynamic geographical , as well as temporal features relevant to crime, (3) builds a virtual urban environment with these layers, in which individual offender agents navigate, (4) proposes a data-driven decision-making process using machine-learning for the agents to decide whether to engage in criminal activity based on their perception of the environment and, finally, (5) generates fine-grained crime patterns in a simulated urban environment. The novelty of this work lies in the various large-scale data layers, the integration of machine learning at individual agent level to process the data layers, and the high resolution of the resulting predictions. The results show that the spatial, temporal, and interaction layers are all required to predict the top street segments with the highest number of crimes. In addition, the spatial layer is the most informative, which means that spatial data contributes most to predictive performance. Thus, these findings highlight the importance of various open data sources and the potential of theory-informed, data-driven simulations for the purpose of crime prediction. The resulting model is applicable as a predictive tool and as a test platform to support crime reduction.ISSN:0198-9715ISSN:1873-758
MOESM1 of Mining large-scale human mobility data for long-term crime prediction
Supplementary material for: Mining large-scale human mobility data for long-term crime prediction (PDF 2.0 MB
Simulating offender mobility:Modeling activity nodes from large-scale human activity data
In recent years, simulation techniques have been applied to investigate the spatiotemporal dynamics of crime. Researchers have instantiated mobile offenders in agent-based simulations for theory testing, experimenting with crime prevention strategies, and exploring crime prediction techniques, despite facing challenges due to the complex dynamics of crime and the lack of detailed information about offender mobility. This paper presents a simulation model to explore offender mobility, focusing on the interplay between the agent's awareness space and activity nodes. The simulation generates patterns of individual mobility aiming to cumulatively match crime patterns. To instantiate a realistic urban environment, we use open data to simulate the urban structure, location-based social networks data to represent activity nodes as a proxy for human activity, and taxi trip data as a proxy for human movement between regions of the city. We analyze and systematically compare 35 different mobility strategies and demonstrate the benefits of using large-scale human activity data to simulate offender mobility. The strategies combining taxi trip data or historic crime data with popular activity nodes perform best compared to other strategies, especially for robbery. Our approach provides a basis for building agent-based crime simulations that infer offender mobility in urban areas from real-world data